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Abstract. One of the greatest scientific achievements of physics in the 
20th century is the discovery of quantum mechanics. The Schrodinger 
equation is the most fundamental equation in quantum mechanics de- 
scribing the time-based evolution of the quantum state of a physical sys- 
tem. It has been found that the time-independent version of the equation 
can be derived from a general optimization algorithm. Instead of arguing 
for a new interpretation and possible deeper principle for quantum me- 
chanics, this paper elaborates a few points of the equation as a general 
global optimization algorithm. Benchmarked against randomly generated 
hard optimization problems, this paper shows that the algorithm signif- 
icantly outperformed a classic local optimization algorithm. The former 
found a solution in one second with a single trial better than the best one 
found by the latter around one hour after one hundred thousand trials. 



1 Introduction 

Optimization is a core problem both in mathematics and computer science. It is a 
very active research area with many international conferences every year, a large 
amount of literature, and many researchers and practitioners across many fields 
for a wide range of applications. Combinatorial optimization |ll2j is a branch of 
optimization where the set of feasible solutions of problems is discrete, count- 
able, and of a finite size. The general methods for combinatorial optimization are 
1) local search [213] . 2) simulated anneahng |4l5j . 3) genetic algorithms [61718] . 
4) ant colony optimization [5], 5) tabu search [TU], 6) branch-and-bound [11112] . 
and 7) dynamic programming [12] . The successful applications of different com- 
binatorial optimization methods have been reported in solving a large variety of 
optimization problems in practice. 

Cooperative optimization is a newly proposed general global optimization 
method [13114115] inspired by cooperation principles in team playing. Often 
times, individuals working together as a team can solve hard problems beyond 
the capability of any individual in the team. In its normal form, cooperative 
optimization has one and only one equilibrium and converges to it with an ex- 
ponential rate regardless of initial conditions. It is also capable of identifying 
global optimal solutions so that it can efficiently terminate its search process. 



Recently, it has been found (preprint: http: / /arxiv.org/ abs / quant-ph/0605220 1 
that the time-independent Schrodinger equation can be derived from a general 
form of cooperative optimization in a continuous-time version for continuous 
variables. In particular, when the cooperation among the agents is controlled 
at a certain level, the cooperative optimization algorithm becomes the time- 
independent Schrodinger equation when it reaches an equilibrium (a stationary 
state) . Instead of arguing for a new interpretation and possible deeper principle 
for quantum mechanics, this paper elaborates a few points of the equation as a 
general global optimization algorithm, such as the existence of an equilibrium 
and the time-based evolution. 

In the HDTV research lab, at Tsinghua University, Beijing, China, we found 
a noticeable improvement of the algorithm over the belief propagation [16117] . 
the state-of-the-art algorithm, at decoding some modern channel codes (a NP- 
hard problem similar to the integer programming problem). At solving those 
real- world hard optimization problems, classic optimization methods, includ- 
ing simulated annealing and genetic algorithms, failed to deliver a satisfying 
performance. Instead of using the problems biased towards some particular ap- 
plications, this paper conducts the comparison over randomly generated hard 
optimization problems to check the generality of the Schrodinger equation as an 
optimization algorithm. 

2 Cooperative Optimization and Schrodinger Equation 

The fundamental difference between cooperative optimization and many classical 
optimization methods is at the very core of optimization, i.e., the way of making 
decisions for assigning decision variables. Classic one often times make a precise 
decision at assigning a variable at every time-instance of optimization, such 
as X = 3 for a time instance t. Such an assignment is precise at the sense 
that X can only be of the value 3, not any other ones. In contrast, cooperative 
optimization makes a soft decision, represented by a probability-like function 
called an assignment function, such as W{x,t), at every time-instance t. It says 
that at the time instance t, the variable x can be of any value with the likelihood 
measured by the function value ^{x,t). A variable value of a higher function 
value is more likely to be assigned to the variable x than anyone of a lower 
function value. 

If the function 'I'{x, t) at time t is peaked at a specific value, say a; = 3, while 
the others equal to zero, then the soft decision falls back to the classic precise 
decision, e.g., assigning the value 3 to the variable x {x — 3). Hence, the soft 
decision making is a generalization of the classic precise decision making. 

Let E{xi, X2, . . . , Xn), simply E{x), be a multivariate objective function of n 
variables. Assume that E{x) can be decomposed into n sub-objective functions 
Ei{x), one for each variable, such that those sub-objective functions satisfying 

Ei{x) + E2{x) + ... + En{x) = E{x) , 

and/or the minimization of Ei{xi) with respect to Xi also leads to the minimiza- 
tion of E{x) for any i. 
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In terms of a multi-agent system, let us assign Ei(x) as the objective function 
for agent i, for i — 1,2, ... ,n. There are n agents in the system in total. The 
objective of the system is to minimize E{x) and the objective of each agent i is 
to minimize Ei{x). 

A simple form of cooperative optimization is defined as an iterative update 
of the assignment function of each agent as follows: 

[e-^'(-)/'"'n^^^(^^'*-l)) ' for* = l,2,...,n, (1) 

where X^^^,. stands for the summation over all variables except Xi and ?i is a 
constant of a small positive value. Pi{xi, t) is defined as 

p,{x,,t)^{Mx,,t)r /Y,mx.,t)r , (2) 

Xi 

where a is a parameter of a positive real value. 

By the definition, pi[xi,t) just likes a probability function satisfying 

^p^{xr,t) = 1 . 

It is, therefore, called the assignment probability function. It defines the probability- 
like soft decision at assigning variable Xi at the time instance t. 

The original assignment function ^i{xi, t), is called the assignment state func- 
tion. That is, the state of agent i at the time instance t is represented by its 
assignment state function ^i{xi,t). From Eq. [2]we can see that the assignment 
probability function pi{xi,t) is defined as the assignment state function Wi(xi) 
to the power a with normalization. To show the relationship, the assignment 
probability function pi{xi,t) is also expressed as (^i{xi,t)'j in the following 
discussions with the bar standing for the normalization. 

With this notation, the iterative update function ([1]) can be rewritten as 

tf',(a;„t)=^ , for » = 1, 2, . . . , n . (3) 

Without loss of generality, let be the utility function of agent i in 

terms of game theory. It is important to note that when the parameter a is of a 
sufficient value, i.e., a — > oo, the simple form converges to an equilibrium if 
and only if it is also a Nash equilibrium (preprint: http://arxiv.org/abs/0901.3615). 
Nash Equilibrium is arguably the most important concept in game theory, critical 
for understanding a common scenario in game playing. It offers the mathematical 
foundation for social science and economy. 

By substituting Eq.[T]into Eq.[2l we have a mapping from a set of assignment 
probability functions to itself. Because the set is compact and the mapping is 
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continuous, so a fixed point exists based on Brouwer fixed point theorem. Since 
a set of assignment state functions is uniquely defined by a set of assignment 
probability functions by Eq. [1] We can conclude that there exists at least one 
set of assignment state functions if'j (2^2), • ■ • ,^ni^ri)} such that 

In particular, when a = 2, the simple general form of cooperative optimiza- 
tion over the real domain M given in ([3]) can be generalized further over the 
complex domain C in a continuous-time version as follows 

where Zi{t) is a normalization factor such that 

and \^pi{xi,t)\'^ is defined as 

Following the notation from physics, denote ipi{xi^t) and a vector | tpiit)). 
The equation ([4]) can be generalized further as follows 

I Mt)) = -^H, I ^dt)) • (5) 

In the equation. Hi is a hermitian matrix which defines the local energy (ob- 
jective) function for agent i in a more general form. In particular, for the case 
of Q, the hermitian matrix Hi reduces to a diagonal matrix with the diagonal 
elements as 

E I ^j(^)]^ : foi' every value of Xi . 

The function "tjjiixi, t) is also called a wavefunction in physics. It is important 
to note that the equation ([5]) is the dual equation of the Schrodingcr equation: 

where —1 is replaced by the imaginary unit i(— V^l) ^md the normalization 
factor Zi{t) is not required since the equation is unitary, which means that the 
total norm of the wavefunction ipi{xi,t) is reserved, i.e., 



\ 'ipi(xi, i)P = const, for any t. 



4 



When the dynamic equation ([S]) reachs a stationary point (equiUbrium) , it 
has been found out that the equation becomes the time-independent Schrodinger 
equation, one of the most important equations in quantum mechanics. That is 

ei I ipi{xi,t)) = Hi I ^i{xi,t)) , 

where can only be any one of the eigenvalues of H^. 

If the local energy function Hi is time- independent, then the time-based 
evolution of the state function | ipi{t)) is 

I Mt)) = ^f'~'"'' I ^^(0)) • 

Assume that | | 02), . . . , | ^at) are the N eigenvectors of the Hamilitian 
matrix Hi with corresponding eigenvalues as ei, 62, . . . , e„, respectively. Then 
any initial state | ipii^)) can be represented as the linear superposition of the N 
eigenvectors as follows 

I V^(0)) = Ci I 0i) + C2 I (/'2) + • • • + CJV I 0Ar) , 

where c„ (1 < n < and c„ £ C) are coefficients. The state function | ipi{xi,t)) 
at time t is governed by 

1 ^ 

' 3=1 

Benefit from almost a century's mathematical development driven quantum 
physics, we can extend the concept of a hermitian matrix to that of a hermitian 
operator to deal with the case when the variable Xi is continuous. In this case, 
the concept of an eigenvector will be extended to that of an eigenfunction. 

Putting everything together, the pseudo-code of a quantum optimization 
algorithm is shown in Figure [TJ 

3 Experimental Comparisons 
3.1 Constraint Optimization Problems 

A large class of optimization problems has an objective function of the following 
form, 

n 

E{x) =^fi{x^) + ^ J^j{xi,Xj) . (6) 

The function fi{xi) is called an unary function and the function fij{xi,Xj) is 
called a binary function. To note the collection of all defined binary functions, 
the set Af is used which contains non-ordered pairs of variable indices where each 
pair corresponds to a defined binary function fij{xi,Xj). 
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1 begin 



2 for every i, initialize i'i{xi) with random non-negative values and normalize it; 

3 for fc := 1 to maximum JterationjT,umber do 

4 begin 

5 for i := 1 to n do 

6 begin 

7 for each value of Xi do 

8 E... (e-^-<^'/"n,^.'^i fe)); 

9 Let Xi ~ argmaxi. 'I'i{xi); 

10 end; 

11 normalize ^i{xi) such that \ipi{xi)\'^ — 1; 

12 end; 

13 return (ii, X2, ■ ■ ■ , x„) as the final solution; 



14 end; 



Fig. 1. A quantum optimization algorithm 

Minimizing the objective function defined in ([5]) is referred to as a constraint 
optimization problem (binary COP) in AI. The unary function fi{x) is called 
an unary constraint on variable Xi and the binary function {xi , Xj ) is called a 
binary constraint on variables Xi , Xj . 

The constraint optimization problem is a very general formulation for many 
optimization problems arose from widely different fields. Examples are the fa- 
mous traveling salesman problem, the weighted maximum satisfiability problem, 
the quadratic variable assignment problem, stereo matching in computer vision, 
image segmentation in image processing, and many more. The constraint opti- 
mization problem belongs to the class of NP-complete problems. 

An objective function in form of ^ can be represented with an undirected 
graph. In the graph, each variable Xi is represented by a node and each binary 
constraint fij{xi, Xj) is represented by an undirected edge connecting variable 
node Xi to variable node Xj. In this graphic representation, the degree of a node 
is measured by the total number of edges connected to it. 

Each variable node Xi can have a number of neighboring variable nodes. Let 
Af{i) be the set of the indices of the neighboring variables of Xi. By definition, 

m^ = {J\{^,J)eM} . 

Using the notations, we can define a local objective function for agent i as 
Ei{x) ^ fi{xi) + ^ fij{xi,Xj), for i = 1,2, . . . ,n . (7) 

Obviously, minimizing the local objective function Ei(x) with respect to Xi also 
leads to the minimization of the global objective function E{x). 
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3.2 A Classic Local Optimization Algorithm 

Local search is one of the classic methods for attacking hard combinatorial opti- 
mization problems. It also plays a fundamental role at understanding many other 
optimization methods, such as simulated annealing, genetic algorithms, and tabu 
search. Started from a randomly generated initial solution, local search itera- 
tively tries to replace the current solution by a better one in the neighborhood 
of the current solution until no further improvement is possible. 

The improvement of the local search algorithm choosing in the experiments 
is achieved by adjusting the value of each variable while other variables have 
their values fixed. Specifically, assume that the current solution is x and Xi is 
the variable with its value to be adjusted. Its value in the current solution is Xi. 
Let X-i denote all variables except Xi. In this case, the neighborhood is defined 
as {xi^X-i) where Xi G the domain of Xi. The adjustment is achieved by 
finding the best value for Xi, denote as x*, such that 

E(x*) — Ta.iTiE{xi,x-i) , 

If X* 7^ Xi, then replacing Xi by x*. Otherwise, keeping Xi as the value of Xi. 
If X* — Xi and it is true for every variable Xi, then no solution improvement 
is possible by adjusting any variable. In this case, the local search algorithm is 
called reaching a local optimal solution since it is optimal with respect only to 
its neighbors. It terminates the search process and returns the current solution 
X as the final one. 

In particular, if E{x) can be decomposed into n sub-objective functions Ei{x) 
and minimizing Ei{x) with respect to Xi also leads to minimizing E{x), then the 
above optimization problem becomes 

Ei{x*) = min£'j(xi, . (8) 

Xi 

A pseudo-code of the above local search algorithm is given in Figure [H 
The multi-restart local search algorithm simply calls the above local search 
algorithm multiple times and picks the best solution (the best local optimal 
solution) as the final solution. In other words, it discovers multiple local optimal 
solutions and keeps the best one. 

3.3 The Experimental Results 

The size of a randomly generated instance is controlled by the total number of 
variables and the total number of values for each variable. The edges connecting 
to each variable node are randomly selected. If each node is connected to all 
other nodes, then the graph is a full graph. Otherwise, if it is connected to a 
small percentage of other nodes, then the graph is sparsely connected. Therefore, 
we can use the average node degree to control the graphical structure of each 
generated instance. 
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1 begin 



5 randomly generate a solution x; 

6 repeat 

7 local jninimum^f lag := true; 

8 for i := 1 to n do 

9 begin 

10 find X* such that E{xX) ~ miua;. E{xi, x-i); 

11 if x* 7^ Xi then 

12 begin 

13 . — i 1 

14 localjrninimum^flag :— false; 

15 end; 

16 end; 

17 until localjminimum^flag — true; 
20 return i as the final solution; 



21 end; 



Fig. 2. The local search algorithm 



The function values of the constraints fi{xi) and fij{xi,Xj) are uniformly 
sampled from the interval [0, 1]. Because of the randomness, it is hard to apply 
any domain-specific heuristics for the advantage of any optimization algorithm. 

In the first set of experiments, ten instances of the constraint optimization 
problem are generated. Each instance has 121 variables, 50 values for each vari- 
able, and the average node degree is six. Each instance has an enormous number 
of feasible solutions, bigger than the total number of atoms in the universe, 
posing a challenging optimization problem. 

For each of those ten randomly generated instances, the best solution of 
the multi-restart local search algorithm (MRLS) after 100,000 trials is com- 
pared with the solution found by the quantum optimization algorithm (QOA) 
(Fig. [T|) with a single trial. The parameters of the algorithm are set as ?i = 1 
and maximumjiteration^number — 20. The results are given in the following 
table. From it we can see that the quantum optimization algorithm yields better 
results within a much shorter time than the multi-restart local search algorithm. 
In the table, variable^ represents the total number of variables and valued the 
total number of values for each variable. 
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MRLS with 100,000 trials vs QOA with a single trial 

variablc:^=121, vahic#=50, avcragc_nodc_degree=6 



Instance^ MRLS:cost (time) QOAxost (time) improvement 



1 


153.11 


(3001 


sec) 


144.77 


(1.061 sec) 


5.76% 


2 


153.92 


(3067 


sec) 


144.02 


(1.046 sec) 


6.87% 


3 


152.12 


(2994 


sec) 


136.84 


(1.030 sec) 


11.17% 


4 


153.53 


(3072 


sec) 


144.34 


(1.045 sec) 


6.37% 


5 


154.95 


(3013 


sec) 


145.22 


(1.061 sec) 


6.70% 


6 


148.17 


(3012 


sec) 


140.37 


(1.029 sec) 


5.55% 


7 


147.90 


(2905 


sec) 


138.83 


(1.014 sec) 


6.54% 


8 


171.03 


(3275 


sec) 


158.82 


(1.138 sec) 


7.69% 


9 


154.70 


(3013 


sec) 


145.59 


(1.061 sec) 


6.26% 


10 


145.14 


(2972 


sec) 


130.19 


(1.030 sec) 


11.49% 



The following table lists the experimental results with another set of in- 
stances where their sizes are different from the previous ones. The total number 
of variables is inceased to 1001 and the total number of values for each variable 
is reduced to 10. From the table we can see that the quantum optimization algo- 
rithm still outperforms the multi-restart local search algorithm both in quality 
and speed. 

MRLS with 10,000 trials vs QOA with a single trial 

variable#=1001,value#=10,average_node_degree=10 



Instance^^ MRLSxost (time) QOAxost (time) improvement 



1 


3269.97 


(41.6 


sec) 


3102.63 


(1.373 


sec) 


5.39% 


2 


3221.61 


(4126 


sec) 


3084.81 


(1.415 


sec) 


4.43% 


3 


3237.84 


(4179 


sec) 


3090.48 


(1.389 


sec) 


4.77% 


4 


3270.37 


(4134 


sec) 


3159.82 


(1.435 


sec) 


3.50% 


5 


3267.66 


(4113 


sec) 


3109.14 


(1.373 


sec) 


5.10% 


6 


3307.75 


(4108 


sec) 


3204.13 


(1.404 


sec) 


3.23% 


7 


3248.23 


(4073 


sec) 


3153.07 


(1.389 


sec) 


3.02% 


8 


3273.33 


(4077 


sec) 


3146.69 


(1.388 


sec) 


4.02% 


9 


3300.05 


(4117 


sec) 


3188.34 


(1.388 


sec) 


3.50% 


10 


3269.44 


(4126 


sec) 


3141.70 


(1.454 


sec) 


4.07% 



4 Conclusions 

The Schrodinger equation can be converted into a general purpose optimization 
algorithm. The original equation is in a continuous-time version while the one 
presented in this paper is in a discrete-time version so that it can be easily im- 
plemented by computer software or hardware. Benchmarked against randomly 
generated hard optimization problems, the quantum optimization algorithm sig- 
nificantly outperformed a classic multi-restart local search algorithm by several 
orders of magnitude. Together with many other existing optimization methods, 
the quantum optimization can be served as a new method to attack hard opti- 
mization problems. 
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